cltk's Repositories
100 repositories
alatinparser
ALP (A Latin Parser) is a syntactic parser for a small subset of classical Latin.
β 3
π Public
ang_models_cltk
No description
β 6
π Public
annotations
A tool for annotating texts using Draft.js
β 13
π Public
arabic_morphology_quranic-corpus
No description
β 2
π Public
arabic_text_perseus
corpus for Classical arabic
β 1
π Public
arabic_text_quranic_corpus
No description
β 0
π Public
bengali_text_wikisource
No description
β 3
π Public
capitains_corpora_converter
Converts CapiTainS-based Repository ( http://capitains.github.io ) to JSON for CLTK
β 0
π Public
capitains_text_corpora
Processed docs from capitains_corpora_converter
β 1
π Public
chinese_text_cbeta_01
Chinese Buddhist scriptures from CBETA
β 0
π Public
chinese_text_cbeta_02
Chinese Buddhist scriptures from CBETA
β 1
π Public
chinese_text_cbeta_indices
Indices to the CBETA corpus
β 4
π Public
chinese_text_cbeta_taf_xml
No description
β 0
π Public
chinese_text_cbeta_txt
No description
β 0
π Public
chinese_text_sheffield
Texts from the Sheffield Corpus of Chinese
β 0
π Public
chinese_text_wikisource
No description
β 0
π Public
classical_arabic_models
Statistical models for Classical Arabic
β 0
π Public
cltk
The Classical Language Toolkit
β 880
π Public
cltk.github.io
Static website for CLTK organization, built with Jekyll
β 1
π Public
cltkv1
Experimental repo for new API CLTK
β 1
π Public
π¦ Archived
cltk_api
RESTful API for the CLTK
β 13
π Public
π¦ Archived
cltk_api_v2
No description
β 1
π Public
cltk_community_api
No description
β 1
π Public
cltk_docker
Docker script for cltk
β 6
π Public
cltk_frontend
Reading environment connecting to API from cltk/cltk_api repo
β 20
π Public
π¦ Archived
cltk_grc_liddell_scott_intermediate
No description
β 1
π Public
cltk_lat_lewis_elementary_lexicon
No description
β 0
π Public
cltk_non_zoega_dictionary
No description
β 0
π Public
cltk_vagrant
Vagrant and other bootstrap methods for CLTK core and CLTK API
β 0
π Public
coptic_text_scriptorium
Public repository for Coptic SCRIPTORIUM Corpora Releases
β 0
π Public
csel_openphilology_corpus
CSEL orpus based on https://github.com/OpenGreekAndLatin/csel-dev/
β 0
π Public
english_texts_wikisource
No description
β 3
π Public
enm_models_cltk
Models for Middle English provided by CLTK
β 1
π Public
escriptorium-deploy
Scripts to deploy the eScriptorium OCR system
β 2
π Public
extras
Place for modules left out of transition to v1.0
β 0
π Public
First1KGreek
XML files for the works in the First Thousand Years of Greek Project.
β 3
π Public
french_lexicon_cltk
Old French lexicon from wikisource.org
β 1
π Public
french_text_wikisource
Collected texts from wikisource.org
β 2
π Public
fro_models_cltk
No description
β 0
π Public
germanic_models_cltk
No description
β 1
π Public
gmh_models_cltk
Stored data for tagging Middle High German
β 1
π Public
gml_models_cltk
No description
β 1
π Public
grc_models_cltk
Trained taggers, tokenizers, etc. for the CLTK
β 9
π Public
grc_software_tlgu
Utility for converting TLG & PHI corpora to Unicode
β 7
π Public
grc_text_perseus
Collected Greek files from the Perseus Digital Library
β 11
π Public
grc_text_tesserae
Plaintext files with Ancient Greek texts from the Tesserae Project
β 6
π Public
greek_lexica_perseus
Lexica and lemmata for the Ancient Greek language, from various sources
β 20
π Public
greek_ner_v1
No description
β 0
π Public
greek_pos_edit_xenophon_anabasis
A humanβeditable version of a POSβtagged text of Xenophon's Anabasis
β 2
π Public
greek_proper_names_cltk
A list of ~144K Classical Greek proper names
β 4
π Public
greek_software_tlgu_python
A python wrapper for greek_software_tlgu
β 1
π Public
greek_text_lacus_curtius
Collected Greek Texts from Lacus Curtius
β 0
π Public
greek_training_set_sentence_cltk
Training sets and tokenizer for the Classical Greek language, for use with CLTK
β 5
π Public
greek_treebank_perseus
Greek treebank from the Perseus Digital Library
β 12
π Public
greek_word2vec_cltk
Greek Word2Vec models
β 6
π Public
gujarati_text_wikisource
Collected Gujarati texts from wikisource.org
β 1
π Public
hebrew_text_sefaria
Structured Jewish texts and metadata exported from Sefaria's database.
β 2
π Public
hindi_text_ltrc
Corpus of Raw text for Classical Hindi
β 3
π Public
iswoc-treebank
Official releases of the ISWOC treebank
β 0
π Public
javanese_text_gretil
extracted the old javanese text.
β 0
π Public
lapos
Fork of the Lookahead Part-Of-Speech (Lapos) Tagger
β 5
π Public
latin-macronizer
Script to automatically mark long vowels in Latin texts. Also optionally performs conversion of u to v and i to j.
β 1
π Public
latin_lexica_perseus
Lexica and lemmata for the Latin language, from various sources
β 6
π Public
latin_pos_lemmata_cltk
No description
β 11
π Public
latin_proper_names_cltk
A list of ~40K Classical Latin proper names
β 8
π Public
latin_text_antique_digiliblt
Antique Latin Corpus from digilibLT
β 2
π Public
latin_text_corpus_grammaticorum_latinorum
Collected Latin Data from Corpus Grammaticorum Latinorum
β 4
π Public
latin_text_lacus_curtius
Collected Latin files from LacusCurtius
β 2
π Public
latin_text_poeti_ditalia
Corpus for Italian Poetry in Latin
β 1
π Public
latin_training_set_sentence_cltk
Training sets and tokenizer for the Latin language, for use with CLTK
β 4
π Public
latin_treebank_index_thomisticus
Treebank of the works of Thomas Aquinas
β 0
π Public
latin_treebank_perseus
Latin treebank from the Perseus Digital Library
β 5
π Public
latin_word2vec_cltk
Latin Word2Vec models
β 2
π Public
lat_models_cltk
Trained taggers, tokenizers, etc. for the CLTK
β 10
π Public
lat_text_latin_library
Collected files from thelatinlibrary.com
β 22
π Public
lat_text_perseus
Collected Latin files from the Perseus Digital Library
β 13
π Public
lat_text_tesserae
Plaintext files with Latin texts from the Tesserae Project
β 8
π Public
malayalam_text_gretil
contains malayalam_text
β 0
π Public
marathi_text_wikisource
No description
β 7
π Public
middle_english_text_cmepv
Texts from Corpus of Middle English Prose and Verse
β 2
π Public
middle_high_german_texts
No description
β 0
π Public
morpheus
Morpheus parser
β 1
π Public
multilingual_treebank_proiel
Official releases of the PROIEL treebank of ancient Indo-European languages
β 2
π Public
non_models_cltk
Trained tagger for Old Norse
β 0
π Public
non_texts
Classical Texts from Old Norse Literature
β 0
π Public
old-norse-lemmatizer
No description
β 2
π Public
old_church_slavonic_ccmh
No description
β 1
π Public
old_english_text_sacred_texts
No description
β 2
π Public
old_norse_runes_corpus
No description
β 2
π Public
old_norse_texts_heimskringla
Texts retrieved from Heimskrinla.no for easy use with cltk!
β 2
π Public
old_norse_text_perseus
No description
β 2
π Public
old_swedish_texts
No description
β 0
π Public
pali_models_cltk
No description
β 0
π Public
pali_texts_gretil
No description
β 1
π Public
pali_text_ptr_tipitaka
Pali Tipitaka packaged with the Digital Pali Reader
β 3
π Public
prakrit_texts_gretil
No description
β 1
π Public
punjabi_text_gurban
Punjabi Files of Gurbani
β 4
π Public
sanskrit_parallel_gitasupersite
Parallel corpus
β 11
π Public
sanskrit_parallel_sacred_texts
This Repository contains parallel Sanskrit and English Documents.
β 8
π Public
sanskrit_pos_jnu_tagged
No description
β 2
π Public